Wikipedia-based Approach for Linking Ontology Concepts to their Realisations in Text
نویسندگان
چکیده
A novel method to automatically associate ontological concepts to their realisations in texts is presented. The method has been developed in the context of the Papyrus project to annotate texts and audio transcripts with a set of relevant concepts from the Papyrus News Ontology. To avoid strong dependency on a specific ontology, the annotation process starts by performing a Wikipedia-based annotation of news items: the most relevant keywords are detected and the Wikipedia pages that best describe their actual meaning are identified. In a later step this annotation is translated into an Ontology-based one: keywords are connected to the most appropriate ontology classes on the basis of a relatedness measure that relies on Wikipedia knowledge. Wikipedia-annotation provides a domain independent abstraction layer that simplify the adaptation of the approach to other domains and ontologies. Evaluation has been performed on a set of manually annotated news, resulting in 58% F1 score for relevant Wikipedia pages and 64% for relevant ontology concepts identification.
منابع مشابه
A Resource-Poor Approach for Linking Ontology Classes to Wikipedia Articles
The applicability of ontologies for natural language processing depends on the ability to link ontological concepts and relations to their realisations in texts. We present a general, resource-poor account to create such a linking automatically by extracting Wikipedia articles corresponding to ontology classes. We evaluate our approach in an experiment with the Music Ontology. We consider linki...
متن کاملDBpedia based Ontological Concepts Driven Information Extraction from Unstructured Text
In this paper a knowledge base concept driven named entity recognition (NER) approach is presented. The technique is used for information extraction from news articles and linking it with background concepts in knowledge base. The work specifically focuses on extracting entity mentions from unstructured articles. The extraction of entity mentions from articles is based on the existing concepts ...
متن کاملEffective Ontology Learning : Concepts' Hierarchy Building using Plain Text Wikipedia
Ontologies stand in the heart of the Semantic Web. Nevertheless, heavyweight or formal ontologies’ engineering is being commonly judged to be a tough exercise which requires time and heavy costs. Ontology Learning is thus a solution for this exigency and an approach for the ‘knowledge acquisition bottleneck’. Since texts are massively available everywhere, making up of experts’ knowledge and th...
متن کاملExploiting WordNet for Wikipedia-Based Named Entity Disambiguation
Entity disambiguation is an important problem in semantic analysis and natural language processing. In this paper, we propose an approach to employ features of the WordNet ontology in the task of disambiguating named entities to Wikipedia. Methods of enriching text with synonymous relations of words are explored. An analysis of the results from our experiments shows that the accuracy of the dis...
متن کاملLightweight domain ontology learning from texts: graph theory-based approach using Wikipedia
Ontology Engineering is the backbone of the Semantic Web. However, the construction of formal ontologies is a tough exercise which requires time and heavy costs. Ontology Learning is thus a solution for this requirement. Since texts are massively available everywhere, making up of experts’ knowledge and their know-how, it is of great value to capture the knowledge existing within such texts. Ou...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010